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Audio coding 



This invention relates to the coding of a multi-channel audio signal and, more 
particularly, to the coding of a multi-channel audio signal which includes at least a first 
signal component, a second signal component and a third signal component 



Parametric descriptions of audio signals have gained interest during the last 
years, especially in the field of audio coding. It has been shown that transmitting (quantized) 
parameters that describe audio signals requires only little transmission capacity and that they 
allow a decoding at the receiving end which results in an audio signal that perceptually does 
10 not significantly differ fiom the original signal, 

European patent application EP 1 107 232 discloses a parametric coding 
scheme for a stereo signal comprising a left (L) and a right (R) channel signal The coding 
scheme generates a representation of the stereo signal which includes information concerning 
only one of the L and R signals and parametric information based on which, together with the 
15 above infonnation concerning one of the L and R signals, the other signal can be recovered. 

However, the above prior art document is not concerned with the problem of 
efficiently coding multi-channel signals which comprise more than two channels. 



The above and other problems are solved by a method of encoding a multi- 
channel audio signal including at least a first signal component, a second signal component 
and a third signal component, the method comprising: 

encoding the first and second signal components by a first parametric encoder 
resulting in a first encoded signal and a first set of encoding parameters; 

encoding the first encoded signal and a further signal by a second parametric 
encoder, resulting in a second encoded signal and a second set of encoding parameters, where 
the further signal is derived from at least the third signal component; and 
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representing the multi-channel audio signal at least by a resulting encoded 
signal derived from at least the second encoded signal, by the first set of encoding parameters 
and by the second set of encoding parameters. 

Hence, by cascading a plurality of parametrio coders, suoh as stereo ooders, an 
5 efficient coding scheme for multi-channel audio signals is provided. According to the 
oasoading scheme, the output of a first parametric encoding step is fed as an input to a 
subsequent second encoding step together with a further input signal, e.g. the output of 
another second parametric encoding step. 

Consequently, according to the invention, a multi-channel signal with n>2 
10 audio channels may be encoded as a stogie encoded signal channel and a number of encoding 
parameter bit streams corresponding to the parametric encoders, ihereby providing a high 
coding efficiency. 

In apreferred embodiment, the multi-channel audio signal further comprises a 
fourth signal component; the method further comprises encoding the third and fourth signal 
15 components by a third parametric encoder resulting to the further signal and a third set of 
encoding parameters; and the step of representing me multichannel audio signal comprises 
the step of representing ihe multi-channel audio signal at least by the resulting encoded signal 
derived from at least the second encoded signal, by the first set of encoding parameters, by 
me second set of encoding parameters, and by the third set of encoding parameters. Hence, 
20 the further input signal to the second parametric encoder is also an output of a previous 
. encoder. 

The term parametric encoder refers to an encoder for encoding at least two 
audio channels resulting in a stogie encoded audio channel and a set of encoding parameters 
that allow a decoder to decode the encoded audio channel into two decoded audio channels. 

25 Examples of such parametric coding schemes comprise a coding of a stereo signal as a 
principal component signal and a corresponding rotation angle, a coding of a stereo signal 
into a combination signal and a number of parameters corresponding to the spatial attributes 
of the stereo signal, etc. However, any known suitable parametric encoding scheme may be 
used. The first and second parametric encoding modules may implement the same or 

30 different parametric encoding schemes. 

The resulting encoded signal may be derived from me second encoded signal 
alone, i.e. itmay be identical to or aresult of a transformation of the second encoded signal. 
Alternatively, the resulting encoded signal may be derived ftom a combination of the second 
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encoded signal and another signal. For example, the second encoded signal may serve as an 
input to a further encoding module corresponding to a further cascading stage. 

Within the field of audio coding, the coding of four-channel signals 
comprising a left-front channel, a left-rear channel, a right-front channel, and a right-rear 
5 channel, are particularly relevant According to the invention, such a signal may be 
efficiently encoded by a cascaded chain of three parametric encoders: A first encoder 
encodes the left-front and the left-rear channel resulting in a combined left channel and the 
corresponding encoding parameters. A second encoder encodes the right-front and the right- 
rear channel resulting in a combined right channel and the corresponding encoding 
1 0 parameters. The third encoder receives me combined right channel and me combined left 
channel and generates a single encoded signal and a corresponding third set of encoding 
parameters. 

Furthermore, the emerging technologies of Digital Versatile Disc (DVD) and 
Super Audio Compact Disc (SACD) comprise five audio channels: The four channels 
15 mentioned above and an additional center channel. According to the invention, such a signal 
may efficiently be encoded by using four parametric encoders: Three encoders encode the . 
left and right channels as in the case of a four-channel case above, and the fourth encoder 
receives the output signal of foe above cascaded chain and the center signal as inputs and 
generates a final encoded signal. 
20 m another preferred embodiment, the multi-channel signal comprises a five- 

channel audio signal, the first signal cormionenl includes a leftpfix)m channel of the five- : 
channel audio signal, the second signal component inoludes a lefWear channel of foe five- 
channel audio signal, foe third signal component includes a right-front channel of foe five- 
channel audio signal; foe fourth signal component includes a right-rear channel of foe five- 
25 channel audio signal; foe five-channel audio signal further includes a center signal; and foe 
step of encoding foe first encoded signal and a flirfher signal further comprises combining 
each of foe first encoded signal and foe further signal with foe center signal. Hence, 
according to this embodiment, foe center signal is combined with foe encoded left channel 
and with foe encoded right channel, before encoding foe left and right channel as a final 
30 encoded signal. 

It is a further advantage of fois embodiment that it provides an efficient 
encoding of a five-channel signal with only three stereo encoders. 
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It is a further advantage of the invention that it provides a coding scheme 
which allows a decoder at the receiving end to adapt to die number of reproduction channels 
that are available at the receiving end. 

The present invention can be implemented in different ways including the 

5 method described above and in the following, arrangements for encoding and decoding, and 
further product means, each yielding one or more of the benefits and advantages described in 
connection with the first-mentioned method, and each having one or more preferred 
embodiments corresponding to the preferred embodiments described In connection with the 
first-mentioned method and disclosed in the dependant claims. 

10 It i s noted that the features of the method described above and in the following 

may be implemented in software and carried out in a data processing system or other 
processing means caused by the execution of computer-exeoutable instructions. The 
instructions may be program code means loaded in a memory, such as a RAM, from a storage 
medium or from another computer via a computer network. Alternatively, the described 

15 features may be implemeated by hardwired circuitry instead of software or in combination 
with software. 

The invention further relates to a method of decoding an encoded multi- 
channel audio signal, the method comprising: 

obtaining a first encoded signal, a first set of encoding parameters, and a 
20 second set of encoding parameters from the encoded multi-channel audio signal; 

obtaining first and second decoded signals from die first encoded signal and 
me first set of encoding parameters, me second decoded signal representing at least a first 
signal component of the multi-channel signal; and 

obtaining third and fourth, decoded signals from the first decoded signal and 
25 the second set of encoding parameters. 

The invention further relates to an arrangement for encoding a multi-channel 
audio signal including at least a first signal component, a second signal component and a 
• third signal component, the arrangement comprising: 

a first parametric encoder adapted to encode the first and second signal 
30 components resulting in a first encoded signal and a first set of encoding parameters; 

a second parametric encoder adapted to encode the first encoded signal and a 
further signal, resulting in a second encoded signal and a second set of encoding parameters, 
where the further signal is derived from at least the third signal component 
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The invention further relates to an arrangement for decoding an encoded 
multi-channel audio signal, the arrangement comprising; 

means for obtaining a first encoded signal, a fcst set of encoding parameters, 
and a second set of encoding parameters from the encoded multi-channel audio signal; 

a first decoder adapted to obtain first and second decoded signals from the first 
encoded signal and the first set of encoding parameters, the second decoded signal 
representing at least a first signal component of the multi-channel signal; and 

a second decoder adapted to obtain third and fourth decoded signals from the 
first decoded signal and the second set of encoding parameters. 

The invention further relates to an apparatus for supplying an encoded audio 



a unit for receiving a multi-channel audio signal; 
an arrangement for encoding as described above and in the following for 
encoding the multi-channel audio signal; and 

an output unit for providing the encoded audio signal. 
The invention further relates to an apparatus for supplying a decoded audio 
signal, the apparatus comprising 

an input unit for receiving an encoded audio signal; 

an arrangement for decoding as described above and in the following for 
20 decoding the encoded audio signal; and 

an output unit for providing.fhe decoded audio signal. 

The invention further relates to an encoded multi-channel audio signal 
including an audio signal and first and second sets of parameters, where the audio signal and 
the first set of parameters are generated by a first parametric encoder upon input of a first 
25 encoded signal and a further signal, where the first encoded signal and the second set of 
parameters are generated by a second parametric encoder upon input of a first and second 
signal component of a multi-channel signal, and where fhe further signal is derived from at 
least a third signal component of the multi-channel signal. 

The invention further relates to a storage medium having stored thereon such 
30 an encoded audio signal. 



These and other aspects of ure invention will be apparent and elucidated fiom 
embodiments described in the following with reference to the drawing in which: 
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fig. 1 shows a sohematic view of a system for communicating multi-channel 
audio signals according to an embodiment of the invention; 

fig. 2 shows a block diagram of an encoder for encoding a four-channel audio 
signal according to an embodiment of the invention; 
5 fig. 3 shows a block diagram of a decoder for decoding an encoded four- 

ohannel audio signal according to an embodiment of the invention; 

fig, 4 shows a block diagram of an encoder for encoding a five-channel audio 
signal according to an embodiment of the invention; 

fig. 5 shows a block diagram of a decoder for decoding an encoded five- 
10 channel audio signal according to an embodiment of the invention; 

fig. 6 schematically illustrates a first example of an encoding module; 
fig, 7 schematically Illustrates a second example of an encoding module; 
fig. 8 shows a block diagram of an encoder for encoding a five-ohannel audio 
signal according to an embodiment of the invention; 
I 5 % 9 shows a block diagram of a decoder for decoding an encoded five- 

channel audio signal according to an embodiment of the invention; 

fig.10 shows a block diagram of the decoder 901 of fig. 9 according to an 
embodiment of the invention; and 

fig. 1 1 schematically illustrates examples of fonotional forms of the three 
20 functions used to deteimtae the weighting feotors in the embodiment of fig, 10. 

Fig. 1 shows a schematic view of a system for communicating multi-channel 
audio signals according to an embodiment of the invention. The system comprises a coding 

25 device 101 for generating a coded four-channel signal and a decoding device 1 05 for 

decoding areceived coded signal into a four-channel signal. The coding device 101 and the 
decoding device 105 each may be any electronic equipment or part of such equipment. 

Here, the term electronic equipment comprises computers, such as stationary 
and portable PCs, stationary and portable radio communication equipment and other 

30 handheld or portable devices, such as mobile telephones, pagers, audio players, multimedia 
players, communicators, i.e, electronic organizers, smart phones, personal digital assistants 
(PDAs), handheld computers, or the like. It is noted that the coding device 101 and the 
decoding device may be combined in one electronic equipment where audio signals are 
stored on a computer-readable medium for later reproduction. 
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7 10.07,2002 
The coding device 101 comprises an input unit 1 11 for receiving a multi- 
channel signal, m encoder 102 for encoding a four-channel audio signal, foe four-channel 
signal including a left-front signal component LF, a left-rear signal component LR, a right- 
front signal component RF, and a rigbfr-rear signal component RR. The encoder 1 02 receives 
5 the four signal components via foe input unit 111 and generates a coded signal T. The four- 
channel signal may originate from a set of microphones, eg. via further electronic 
equipment, such as a mixing equipment, etc. The signals may former be received as an output 
from another audio player, over-foe^ir as a radio signal, or by any other suitable means. 
Preferred embodiments of suoh an encoder according to the invention will be described 
10 below. 

According to one embodiment, foe encoder 102 is connected to a transmitter 
1 03 for transmitting the coded signal T via a communications channel 1 09 to foe decoding 
device 105. The transmitter 103 may comprise circuitry suitable for enabling foe 
communication of data, e .g. viaawiredor a wireless data link 109. Examples of such a 
transmitter include a network interface, a network card, a radio transmitter, a transmitter for 
other suitable electromagnetic signals, such as an LED for transmitting infrared light, e.g. via 
anfcDaport, radio-based communications, e.g. via a Bluetooth transceiver, or foe like. 
Further examples of suitable transmitters include a cable modem, a telephone modem, an 
Integrated Services Digital Network (ISDN) adapter, a Digital Subscriber Line (DSL) 
adapter, a satellite transceiver, an Ethernet adapter, or the like. Correspondingly, the 
communications channel 109 may be any suitable wired or wireless data link, for example of 
a packet-based communications network, suchas foe Internet or another TCP/IP network, a 
short-range communications link, such as an infrared link, a Bluetooth connection or another 
radio-based link, 

Further examples of foe communications channel include computer networks 
and wireless telecommunications networks, such as a Cellular Digital Packet Data (CDPD) 
network, a Global System for Mobile (GSM) network, a Code Division Multiple Access 
(CDMA) network, a Time Division Multiple Access Network (TDMA), a General Packet 
Radio service (GPRS) network, a Third Generation network, such as a UMTS network or foe 
30 like. 

Alternatively or additionally, foe coding device may comprise one or more 
other interfaces 104 for communicating foe coded signal T to foe decoding device 105. 
Examples of such interfaces include a disc drive for storing data on a computer-readable 
medium 110, e.g. a floppy-disk drive, a read/write CD-ROM drive, a DVD-drive, etc. Other 
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examples include a memory card slot a magnetic card reader/writer, an interface for 

accessing a smart card, etc. 

Correspondingly, foe decoding device 105 comprises a corresponding receiver 
108 for receiving the signal transmitted by the transmitter and/or another interface 106 for 
5 receiving the coded signal communicated via the interface 104 and foe computer-readable 
medium 1 1 0. The decoding device further comprises a decoder 107 which receives the 
received signal T and decodes it into corresponding components LF% LR\ RF', and RR' of a 
decoded four-channel signal. Preferred embodiments of such a decoder according to the 
invention will be described below. The decoding device former comprises an output unit 112 
10 for outoumng the decoded signals which may subsequently be fed into an audio player for 
reproduction via a set of four speakers, or me like. 

Fig. 2 shows a block diagram of an encoder for encoding a four-channel audio 
signal according to an embodiment of the invention. The encoder receives a four-channel 
audio signal as an input, where the four input channels to be encoded are designated left-front 
15 (LP), right-front (RF), left-rear (LR), and right-rear (RR), corresponding to the corresponding 
speakers of a four-channel audio system. The encoder comprises parametric encoding 
modules 201, 202. and 203. The encoding module 202 forms a single audio channel L from 
hoth left-side speaker signals LF and LR combined with a corresponding parameter bit 
steam P2. Similarly, the encoding module forms a single audio channel R. from both right- 
20 side speaker signals RF and RR combined with a corresponding parameter bit stream P3. 

Subsequently, the encoding module 201 generates one broadband audio signal 
T from the total-left and total-right signals L and R, respectively. Furthermore, mis merging 
pmcess results in a mird parameter bit stream P 1 that describes foe spatial properties between 

the total-left and total-right channels. 

The encoder further comprises a combiner circuit 206 performing a proper 
encoding of the signal T, for example according to MPEG, e.g. MPEG I layer 3 (MP3), 
according to sinusoidal coding (SSC), or another suitable coding scheme or a combination 
thereof. The combiner circuit 206 further performs framing, bit-rate allocation, and lossless 
coding resulting in a combined signal 207 to be communicated. Alternatively, foe combiner 
circuit206may supply the audio signal T and foe bit streams as two or more separate signals, 

as a multiplexed signal, or me like. 

Hence, tire encoder of fig. 2 generates an outout signal including one 
broadband audio signal T and three parameter bit streams PI, P2, and P3 to be communicated 
to a receiver and/or stored on a storage medium and/or the like. It is noted that, even though 
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the example fig. 2 uses 4 audio channels, a similar approach can he used using a different 
number of audio channels. 

It is understood that, alternatively, the encoder 202 may encode the signals LR 
and RR to generate a total rear signal while the encoder 203 may encode the signals LF and 
5 RF to generate a total front signal. Subsequently, the total front and total rear signals are 
combined by a further encoder. The parameters generated by that encoder may then be used 
for a 2D parameter representation, i.e. the parameters from this encoder may be used as 
overall parameters to decode front from rear channels for both left and right channels. Pig. 3 
shows a block diagram of a decoder for decoding an encoded four-channel audio signal 
10 according to an embodiment of the invention. The decoder comprises a circuit 306 for 

extracting the encoded signal T and the parameter streams PI, P2, and P3 from the received 
signal 307, i.e. the circuit 306 performs an Inverse operation of the oombiner 206 of fig. 2. 

The decoder further comprises parametric decoding modules 301, 302, and 
303 corresponding to the encoding modules 201, 202, and 203, respectively. The cascaded 
15 encoding process described in connection with fig. 2 is reversed in the decoder: The decoder' 
receives a broadband audio signal T and three parameter bit streams Pi , P2, and P3. First, the 
decoding module 301 synthesizes the total-left and total-right signals L and R, respectively, • 
from the single incoming audio signal T using the appropriate parameters PI. If the current 
end-user has only two loudspeakers, the decoding process ends here. 
20 If the end-user has 4 loudspeakers, an additional decoding step is performed: ' 

Decoder 302 receives the total-left signal L and the parameter bit stream P2 and synthesizes 
from it the left-front and left-rear signals LF and LR, respectively. 

Similarly, decoder 303 receives the total-right signal R and the parameter bit 
stream P3 and synthesizes from it the right-front and right-rear signals RF and RR, 
25 respectively. 

In one embodiment, the same parameters may be used for decoder 302 and 
303, thereby further reducing the bandwidth required for transmitting the multi-channel 
signal, as only one of the parameter bit streams P2 and P3 (or a combination thereof) needs to 
be transmitted from the encoder to the decoder. In this embodiment, the parameters PI that 
are fed into decoder 301 determine the left-right spatial sound image, while the parameters 
that enter decoder 302 and 303 determine the front-back spatial image. 

Fig. 4 shows a block diagram of an encoder for encoding a five-channel audio 
signal according to an embodiment of the invention. The encoder comprises encoding 
modules 401, 402, 403, and 404. The encoder receives a five-channel audio signal as an 
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input, where the five input channels to be encoded are designated left-front (LF) 3 right-front 
(RP), left-rear (LR), righteear (RR), and center (C), corresponding to the corresponding 
speakers of a five-channel audio system. 

The encoding modules 402 and 403 generate the total-left and total-right 
5 signals L and R, respectively, and corresponding bit streams P2 and P3, respectively, from 
the corresponding input signals LF, LR and RF, RR, respectively. 

Subsequently, the encoding module 401 generates an audio signal S and 
corresponding bit stream PI from the total-left and total-right signals L and R, respectively. 
Hence, the encoding modules 401 , 402, and 403 correspond to the encoding modules 201, 

10 202, and 203 of fig. 2. 

The encoder of fig. 4 includes an additional cascading stage comprising the 
encoding module 404 which receives the output signal S of encoder 401 and the center signal 
C. The encoding module 404 generates a broadband audio signal T and a parameter bit 
stream representing the mid-side characteristic of the audio signal, 

15 The encoder further comprises a combiner circuit 406 generating an output 

signal 407, as described in connection with circuit 206 in fig. 2. Hence, the encoder of fig. 4 
generates an output signal 407 including one broadband audio signal T and four parameter bit 
streams PI, P2, P3, and P4 to be communicated to a receiver and/or stored on a storage 
medium and/or the like. 

20 Fig. 5 shows a block diagram of a decoder for decoding an encoded five- 

channel audio signal Recording to an embodiment of the invention. The decoder comprises a 
circuit 506 for extracting the encoded signal T and the parameter streams PI, P2, P3, and P4 
from the received signal 507, i.e. the circuit 506 performs an inverse operation of the 
combiner 406 of fig. 4. 

25 The decoder further comprises parametric decoding modules 501, 502, 503, 

and 504 corresponding to the encoding modules 401, 402, 403, and 404, respectively, the 
cascaded encoding process described in connection with fig- 4 is reversed in the decoder: The 
decoder receives a broadband audio signal T and three parameter bit streams PI , P2, P3 a and 
P4. First, the decoding module 504 synthesizes the total side signal S and the side signal C 

30 using the parameters P4. 

Subsequently, the decoders 501, 502, and 503 synthesize the left-front, left- 
rear, right-front, and right-rear signals LF, LR, RF, and RR, respectively, from the total side 
signal S and the parameter bit streams Pi , P2, and P3, as was described in connection with 
the decoder of fig. 3, 
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It is understood that, alternatively, a five-channel audio transmission may be 
achieved by transmitting two audio channels combined with free parameter bit streams, e.g. 
by transmitting an encoded four-channel signal as described in connection with figs. 2 and 3 
and one additional mono channel, 
5 Fig. <5 schematically illustrates a first example of a parametric encoding 

module. The arrangement receives an audio signal having two signal components L and R. 
For example, these signal components may be two of me incoming signal components of a 
multi-channel signal, such as the LF and LR signal components or foe RF and RR signal 
components of a four channel signal, or the encoded total-left and total-right signals 
generated by foe encoders 402 and 403, respectively, in fig. 4. The parametric encoding 
module comprises circuitry 601 for performing a rotation of the incoming signal in foe L-R 
space by an angle ot, resulting in rotated signal components y and r according to the 
transformation 

15 y-Lcosa + R3ina = wtL + w R R 
r = -L sin a + R cos a = -wr L + wl R, 

where w L =cosoc and w R -smct will be referred to as weighting factors, 

Preferably, the angle a is taermined such that it corresponds to a direction of 

20 high signal variance. The direction of maximum signal variance, i.e. me principal component, 
may be estimated by a principal component analysis such that foe rotated y component - ' 
corresponds to the principal component signal which includes most of the signal energy, and 
r is a residual signal. Correspondingly, the encoding module of fig. 6 further comprises ' 
circuitry 602 which determines the angle a or, alternatively, the weighting factors wl and Wr, 

25 for example by performing a principle component analysis (PCA) of me Incoming signal 



30 



In one embodiment, the encoding module of fig. 6 cutouts foe principle 
component signal y and foe rotation parameter a or one of w L and wr. In another 
embodiment, the parametric encoder may determine filter parameters of an adaptive linear 
filter such that the adaptive filter generates an estimate of the residual signal r when foe 
principle component signal y is fed into foe filter as an input. According to this embodiment, 
foe incoming signal is encoded as the principle component signal y, a rotation parameter, and 
a set of filter parameters, thereby allowing a decoder at foe receiver to predict the residual 
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signal r from the received principle component signal y, and to rotate the signal back into the 
L and R direction (see e.g. European patent applications. 02076410.6, filed on 10 april 
2002). 

Fig. 7 schematically illustrates a second example of an encoding module. The 
5 encoding module of fig. 7 describes the spatial attributes of a multi-ohannel audio signal by 
specifying an interaural level difference, an interaural time (or phase) difference, and a 
maximum correlation as a function of time and frequency, as is described in European patent 
application no. 02076588.9, filed on 22 april 2002. The encoding module receives the L and 
R components of a stereo signal as inputs. Initially, by time/frequency slicing circuits 702 
10 and 703, the R and L components, respectively, are split up into several time/frequency slots, 
e.g. by time-windowing followed by a transform operation. 

Subsequently, in the analysis circuit 704, for every time/frequency slot, the 
following properties of the incoming signals are analyzed: 

The interaural level difference, or ILD, defined by the relative levels of the 
IS corresponding band-limited signals stemming from the two inputs, 

The interaural time (or phase) difference (ITD or IPD), defined by the 
interaural delay (or phase shift) corresponding to the peak in the interaural cross-correlation 
function, and 

The (dissimilarity of the waveforms mat can not be accounted for by ITDs or 
20 EDS, which can be parameterized by the maximum value of the cross-correlation function 
(Le., the value of the cross-correlation function at the position of foe maximum peak). 

The three parameters described above vary over time; however, since it is 
known that the binaural auditory system is very sluggish in its processing, the update rate of 
these properties is rather low (typically tens of milliseconds). 
25 The analysis circuit 704 further generates a sum (or dominant) signals 

comprising a combination of the left and right signals. Hence, foe L and R signals are 
encoded as the sum signal S and a set of parameters P as a function of frequency and time, 
the parameters P comprising foe ILD, the ITD/IPD, andfoe maximum value of the cross- 



30 



correlation function. 

Pig. 8 shows a block diagram of an encoder for encoding a five-channel audao 
signal according to an embodiment of foe invention. The encoder comprises encoding 
modules 801, 802, and 803. The encoder receives a five-channel audio signal as an input, 
where foe five input channels to be encoded are designated left-front (LF), right-front (RF), 
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left-rear (LR), right-rear (RR), and side (Q, corresponding to the corresponding speakers of a 
five-channel audio system. 

The encoding modules 802 and 803 generate the total-left and total-right 
signals L and R, respectively, and corresponding bit streams P2 and P3, respectively, from 
5 the corresponding input signals LF, LR and RF, PR, respectively. 

Subsequently, the encoding module 801 generates an audio signal T and 
corresponding bit stream PI from the total-left and total-right signals received from the 
encoding modules 802 and 803, respectively. Hence, the encoding modules 801, 802, and 
803 correspond to the encoding modules 201, 202, and 203 of fig. 2. 
10 However, in contrast to the previous embodiment, the side signal C is 

combined with both me total-left and total-right signals L and R generated by the encoders 
802 and 803, respectively. The encoder of fig. 8 comprises summing circuits 804 for adding 
the side signal to each of the total-left and total-right signals L and R, resulting in combined 
signals L' and R>, respectively which are fed into the encoding module 801 . The encoder 
15 further comprises a combiner circuit 806 for generating the final output signal 807 as 
described in connection with circuit 206 in fig. 2. 

It is an advantage of this embodiment that it provides a more cost-effective 
method to code five-channel audio. 

Fig. 9 shows a block diagram of a decoder for decoding an encoded five- 
20 channel audio signal according to an embodiment of the invention. The decoder of fig. 9 i s 
suitable for decoding a signal encoded by the encoder of fig. 8. The decoder comprises a 
circuit 906 for extracting the encoded signal T and the parameter streams PI, P2, and P3 from 
the received signal 907, i.e. foe circuit 906 performs an inverse operation of the combiner 806 
of fig. 8. 

25 decoder further comprises decoding modules 901, 902, and 903. The 

encoding module 901 receives the encoded audio signal T and the corresponding set of 
parameters PI. Initially, the decoding module 901 analyses the transmitted parameters PI. If 
the parameters PI Indicate that the signal is a mono signal, the decoder outputs the received 
Signal as a side signal. Hence, in mis case, foe signal is fed to a side speaker and no signal is 

30 fed to foe left and right channel outputs L and R of decoder 901. 

If the transmitted parameters PI indicate that foe signal is stereo, foe signal is 
decoded in by distributing foe signal to foe left and right outputs. 

The method used for detecting mono or stereo content depends on foe exact 
coder structure and parameter bit stream For example, In one embodiment using foe 
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parametric encoding of spatial stereo desoribed in connection, with fig. 7, the ITD, ILD and 
correlation parameters determine the spatial signal properties as a function of frequency. 
Hence, for each frequency band, the corresponding bandrlimited signal is fed to the center 
speaker, if the ITD and ILD are olose to zero, e.g. smaller than a predetermined constant, and 

S if the correlation is close to +1, Le. if the difference of 1 minus the correlation is smaller than 
a predetermined constant, e.g. smaller than 0. 1 , For example, the predetermined constant for 
the ITD may be chosen to be of the order of 50-1 00 microseconds, and for the ILD the 
predetermined constant may be chosen e.g. 1 to 3 dB. For all oilier values of the parameters, 
the signal is distributed over the left and right outputs. A preferred embodiment of an 

10 encoding module 901 will be described in connection with fig. 10. 

The decoding modules 902 and 903 decode the total-right and total-left signals 
as described above, resulting in the left-front, left-rear, right-front, and right-rear signal 
components LF, LR, RF, and RR, respectively. 

Fig. 1 0 shows a block diagram of the decoder 90 1 of fig. 9 according to an 

15 embodiment of the invention, The encoding module 901 receives the encoded audio signal T 
and the corresponding set of parameters PI . The general idea behind the decoding module 
901 is to feed (a specific frequency band of) the input signal to the center speaker only if the 
spatial parameters indicate that the output signals are mono (which means ILD=0, ITD=0 a 
correlational), For other values of the spatial parameters, the signal should be sent to the 

20 left and right outputs using the parametric decoder. 

However, it is more desirable to achieve a smooth transition between a 
distribution'to the center output and the left and right outputs depending on the spatial 
parameters. Consequently, the decoding module comprises circuitry 1002 which receives the 
parameters PI and computes weighting functions w 0 and wt- Here, w 0 denotes the relative 

25 amount of the mono input signal that is to be sent to the center output, while m denotes the 
relative amount of the input signal that is to be decoded according to the spatial parameters 
and sent to the left and right output pair. In one embodiment, the relation between the weights 
is set by the following constraint: 

30 W c n -hW lr n ~1 



Here, n denotes a power which indicates whether the system should preserve 
the overall amplitude (n=l ), preserve the total amount of power (n=2) or any other overall 
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signal level measure. Hence If w c is known, wir can be obtained according to the above 
equation and vice versa. 

The decoding module further comprises circuitry 1003 which divides each 
subband of the input signal according to the weight factors W* and Wir between the center 
5 output C and the input T^r to a parametric decoder 1004. The parametric decoder decodes the 
scaled signal Tlr as described above, resulting in the total-left and the total-right signals L 
and R, respectively. 

Preferably, the circuitry 1002 determines the weight w q such that w c = 1, if the 
ILD and ITD of a certain subband equal 0 and if the correlation equals +1 . For other values 
10 of the parameters, w 6 should decrease towards zero. In one embodiment this behavior is 
obtained in the following way: w c is composed of the product of three functions Pi, P* and 
P 3 . Pi only depends on the ILD value of that subband, P 2 only depends on the ITD value of 
the current subband, and P 3 only depends on the cross-correlation of that subband. Thus: 

15 W 0 -P 1 (ILD)-P 2 (ITD).P 3 (p) 

Figs. 1 la-c schematically illustrate examples of functional forms of the three 
functions used to determine the weighting factors in the embodiment of fig. 10, 

Preferably, the functional form of the functions Pi, P 2 , and P 3 should meet the 

20 following constraints: Pi and P 2 have a maximum of +1 for an ILD (respectively ITD) of zero 
and decrease towards zero for smaller or larger values. P3 has a maximum of +1 at 
correlation +1 and decreases towards zero for lower values. Figs. 1 la-c illustrate examples of 
functions P l9 P 2i and P 3> respectively, which fulfill the above conditions. 

It is noted that alternative methods for distributing the decoded signal T 

25 between the center output C, the left output L, and the right output R may be used. For 
example, initially, the signal T may be decoded into an L and an R signal using ih& 
parameters PI, as described above. Subsequently, an algorithm to redistribute two input 
signals over three (left, center, right) outputs may be employed. Hence first the left and right 
output signals of the decoder are computed using any known parametric stereo decoder, 

3 0 followed by a redistribution (matrixing) of signals to the three Qeffe, right and center) outputs. 
Such methods are known in the art of 2-to-5 channel processors, as described in international 
patent application WO 02/07481. 
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It is noted that the above arrangements may be implemented as general- or 
especial-purpose programmable microprocessors, Digital Signal Processors (DSP), 
Application Specific Integrated Circuits (ASIC), Programmable Logic Arrays (PLA), Field 
Programmable Gate Arrays (FPGA), special purpose electronic circuits, etc., or a 

5 combination thereof. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that Hiose skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. 

In the claims, any reference signs placed between parentheses shall not be 
10 construed as limiting the claim. The word "comprising" does not exclude the presence of 
elements or steps other than those listed in a claim. The word "a" or "an" preceding an 
element does not exclude the presence of a plurality of such elements. 

The invention can be Implemented by means of hardware comprising several 
distinct elements, and by means of a suitably programmed computer, In the device claim 
15 enumerating several means, several of these means can be embodied by one and the same 
item of hardware. The mere feet that certain measures are recited in mutually different 
dependent claims does not indicate that a combination of these measures cannot be used to 
advantage. 
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CLAIMS: 



1- A method of encoding a midti-cliamxel audio signal including at least a first 
signal component, a second signal component and a third signal component, 1he method 
comprising: 

encoding the first and second signal components by a first parametric encoder 
5 resulting in a first encoded signal and a first set of encoding parameters; 

encoding the first encoded signal and a further signal by a second parametric 
encoder, resulting in a second encoded signal and a second set of encoding parameters, where 
the further signal is derived from at least the third signal component; and 

representing the multi-channel audio signal at least by a resulting encoded 
10 signal derived from at least the second encoded signal, by the first set of encoding parameters 
and by the second set of encoding parameters. 

2 - A method according to claim 1 , wherein the multichannel audio signal further 
comprises a fourth signal component; the method fijrther comprises encoding the third and 

15 fourth signal components by a third parametric encoder resulting in the further signal and a 
third set of encoding parameters; and the step of representing the multi-channel audio signal 
comprises the step of representing die multi-channel audio signal at least by the resulting " 
encoded signal derived from at least the second encoded signal, by the first set of encoding 
parameters, by the second set of encoding parameters, and by the third set of encoding 

20 parameters. 

3 . A method according to claim 2, wherein the multi-channel signal comprises a 

four-channel audio signal, the first signal component includes a left-front channel of the four- 
channel audio signal, the second signal component includes a left-rear channel of the four- 
25 channel audio signal, the third signal component includes a right-front channel of the four- 
channel audio signal, and the fourth signal component includes a right-rear channel of the 
four-channel audio signal, 
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4, A melliod according to claim 2, wherein the multi-channel signal comprises a 

five-channel audio signal, the first signal component includes a left-front channel of Hie five- 
channel audio signal, the second signal component includes aleft-rear channel of the five- 
channel audio signal, the third signal component includes a right-front ohannel of the five- 

5 channel audio signal, the fourth signal component includes a right-rear channel of the five- 
channel audio signal; tiie five-channel audio signal further includes a center signal; the 
method further comprises encoding the second encoded signal and the center signal hy a 
fourth parametric encoder resulting in a third encoded signal and a fourth set of encoding 
parameters; and the step of representing the multi-channel audio signal comprises 

1 0 representing the multi-channel audio signal at least by the third encoded signal, and by the 
first, second, third and fourth sets of encoding parameters. 

5. A method according to claim 2, wherein the multi-channel signal comprises a 
five-channel audio signal, the first signal component includes a left-front channel of the five- 

15 channel audio signal, the second signal component includes a left-rear channel of the five- 
channel audio signal, the third signal component inoludes a right-front channel of the five- 
channel audio signal; the fourth signal component includes a right-rear channel of the five- 
channel audio signal; 1he five-channel audio signal further includes a center signal; and the 
step of encoding the first encoded signal and a further signal further comprises combining 

20 each of the first encoded signal and the further signal with the center signal. 

6. A method according to claim 2, wherein the multi-channel signal comprises a 
five-channel audio signal, the first signal component includes a left-front channel of the five- 
channel audio signal, the second signal component includes a left-rear channel of the five- 

25 channel audio signal, the third signal component includes aright-front channel of the five- 
channel audio signal, the fourth signal component includes aright-rear ohannel of the five- 
channel audio signal; the five-channel audio signal further includes a center signal; and the 
step of representingthe multi-channel audio signal comprises the step of representing the 
multi-channel audio signal at least by the second encoded signal, the center signal, and by me 

30 first, second, and third sets of encoding parameters. 

7. A method of decoding an encoded multi-ohannel audio signal, the method 

comprising: 
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obtaining a first encoded signal, a first set of encoding parameters, and a 
second set of encoding parameters from the encoded multi-channel audio signal; 

obtaining first and second decoded signals from the first encoded signal and 
the first set of encoding parameters, the second decoded signal representing at least a first 
5 signal component of the multi cha nnel signal; and 

obtaining third and fourth decoded signals from the first decoded signal and 
the second set of encoding parameters* 



10 



8. An arrangement for encoding a multi-channel audio signal including at least a 
first signal component, a second signal component and a third signal component, the 
arrangement comprising: 

a first parametric encoder adapted to encode the first and second signal 
components resulting in a first encoded signal and a first set of encoding parameters; 

a second parametric encoder adapted to encode the first encoded signal and a 
15 further signal, resulting in a second encoded signal and a second set of encoding parameters, 
where the further signal is derived from at least the third signal component 

9. An arrangement according to claim 8, further comprising means for 
representing the multi-channel audio signal at least by a resulting encoded signal derived ■ 
from at least the second encoded signal, by the first set of encoding parameters and by the ' 
second set of encoding parameters. 



20 



10. An arrangement for decoding an encoded multi-channel audio signal, the 

arrangement comprising: 
25 means for obtaining a first encoded signal, a first set of encoding parameters, 

and a second set of encoding parameters from the encoded multichannel audio signal; 

a first decoder adapted to obtain first and second decoded signals from the first 

encoded signal and the first set of encoding parameters, the second decoded signal 

representing at least a first signal component of the multi-channel signal; and 
30 a second decoder adapted to obtain third and fourth decoded signals from the 

first decoded signal and the second set of encoding parameters. 



An apparatus for supplying an encoded audio signal, the apparatus comprising 
a unit for receiving a multi-channel audio signal; 
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an arrangement for encoding as olaimed in claim 8 for encoding toe multi- 
channel audio signal; and 

an output unit for providing the encoded audio signal. 

12. An apparatus for supplying a decoded audio signal, the apparatus comprising 

an input unit for receiving an encoded audio signal; 
an arrangement for decoding as claimed in claim 10 for decoding the encoded 

audio signal; and 

an output unit for providing the decoded audio signal. 



13 An encoded multircbannel audio signal including an audio signal and first and 

second sets of parameters, where the audio signal and the first set of parameters are generated 
by a first parametric encoder upon input of a first encoded signal and a further signal, where 
the first encoded signal and the second set of parameters are generated by a second 
1 5 parametric encoder upon input of a first and second signal component of a multi-channel 
signal, and where me further signal is derived from at least athird signal component of toe 
multi-channel signal. 



14, A storage medium having Stored thereon an encoded audio signal according to 

20 claim 13. 
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ABSTRACT: 



A method of encoding a multi-channel audio signal including at least a first 
signal component (LF) ? a second signal component (LR) and a third signal component (BP). 
The method comprises the steps of encoding the first and second signal components by a first 
parametric encoder (202) resulting in a first encoded signal (L) and a first set of encoding 
5 parameters (P2); encoding the first encoded signal and a fiarther signal (R) by a second 
parametric encoder (201)> resulting in a second encoded signal (T) and a second set of 
encoding parameters (PI), where the further signal is derived from at least the third signal 
component; tod representing Hie multi-channel audio signal at least by a resulting encoded 
signal (T) derived from at least the second encoded signal, by the first set of encoding 
10 parameters and by the second set of encoding parameters. 
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